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The Biotechnology Systems Branch of the Scientific and Technical Informatieif 1 ^ 
Center (STIC) detected errors when processing the following computer readable 
form: 



m 
D 



Application Serial Number: 09/ pay. £q*z 



Source: 



Date Processed by STIC: 



6"- /- o/ 



RECEIVED 

SEP 0 9 MOZ 

1ECH CENTER UN* 0 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 



FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: natin21hclo@usnto,gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 c-mail help: natin3hcln@uspto.gov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version/3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1<825 effective October I, 1990 (old* 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.Q replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 

Checker Version 3.0 can be down loaded from the USPTO website at the followine address: 

http://www.uspto.gov/web/offices/pac/checker 



v Sequence Listing Error ■hnmary 




ERROR DETECTED SUGGESTED CORRECTION 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH -ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 
\/ Wrapped Nucleics The numberrtext at the end of each line \vrappe<p down to the next l i n o —. 

This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3. as this will prevent trapping". 



Wrapped Aminos 



The amino acid numberAext at the end of each line "wrapped " down to the next line. 
This may occur if your file was retrieved in a word processor after creating K. 
Please adjust your right margin to .3, as this will prevent "wrapping". 



Incorrect Line Length The rules require that a line not exceed 72 characters in length. This includes spaces. _ 

Misaligned Amino Acid The numbering under each 5th amino acid is misaligned. This may be caused by the use of tabs 
Numbering between the numbering. It Is recommended to delete any tabs and use spacing between the numbers. 



Non-ASCII 



This file was not saved in ASCII (DOS) text, as require^ by the Sequence Rules. 

Please ensure your subsequent submission is saved irf ASCII text so that it can be processed. 



Variable Length 



Patentln ver. 2.0 "bug* 



Sequence (s) contain n's or Xaa's which represented more than one residue. 

As per the rules, each n or Xaa can only represent a single residue. 

Please present the maximum number of each residue having variable length and 

Indicate in the (ix) feature section that some may be missing. 

A "bug" |n Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequence(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section 
to the subsequent amino acid sequence. This applies primarily to the mandatory <220>-<223> 

sections for Artificial or Unknown sequences. 



Skipped Sequences 
(OLD RULES) 



Sepuence(s) missing. If intentional, please use the following format for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: 

(i) SEQUENCE CHARACTERISTICS:(Do not insert any headings under "SEQUENCE CHARACTERISTICS") 
(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: 
This sequence is intentionally skipped 



Skipped Sequences 
(NEW RULES) 



Please also adjust the "(iii) NUMBER OF SEQUENCES:" response to include the skipped sequence(s). 

Sequence(s) missing. If intentional, please use the following format for each skipped sequence. 

<210> sequence id number 
<400> sequence id number 
000 



Use of n's or Xaa's 
(NEW RULES) 



Use of n's and/or Xaa's have been detected in the Sequence Listing. 
Use of <220> to <223> is MANDATORY If n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa. and which residue n or Xaa represents. 



Use of "Artificial" 
(NEW RULES) 

Use of <220>Feature 
(NEW RULES) 



Patentln ver. 2.0 "bug" 



Use of "Artificial" only as "<213> Organism" response is incomplete, per 1 .823(b) of New Sequence Rules. 
Valid response is Artificial Sequence. 

Sequence(s) are missing the <220>Feature and associated headings. 

Use of <220> to <223> is MANDATORY if <21 3>ORGANISM is "Artificial Sequence" or "Unknown" 
Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register. - 6/01/98, Vol. 63, No. 104, pp. 29631-32) (Sec. t .823 of new Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted 

file, resulting in missing mandatory numeric. identifiers and responses (as indicated on raw sequence listing). 

Instead, please use "File Manager" or any other means to copy file to floppy disk. 
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RAW SEQUENCE LISTING 
PATENT APPLICATION: US/09/807,802 

Input Set : A:\sequencelist.txt 

Output Set: N:\CRF3\05012001\I807802.raw 



DATE: 05/01/2001 
TIME: 11:39:02 



<110> APPLICANT: Wilson, James M. 
Xiao, Weidong 

The Trustees of the University of Pennsylvania 

TITLE OF INVENTION: Adeno-Associated Virus Serotype I Nucleic Acid 
Sequences, Vectors and Host Cells Containing Same 
FILE REFERENCE: GNVPN . 031PCT 
CURRENT APPLICATION NUMBER: US/09/807 , 802 
CURRENT FILING DATE: 2001-04-16 v-' 
PRIOR APPLICATION NUMBER: 60/107,114 
17 <151> PRIOR FILING DATE: 1998-11-05 
19 <160> NUMBER OF SEQ ID NOS: 20 
21 <170> SOFTWARE: Patentln Ver . 2.0 



4 

5 
6 

8 <120> 
9 

11 <130> 

13 <140> 

14 <141> 

16 <150> 



- 2 



ERRORED SEQUENCES 



23 <210> SEQ ID NO: 1 

24 <211> LENGTH: 4718 



25 <212> 

26 <213> 

28 <220> 

29 <221> 
<222> 
<220> 
<221> 
<222> 
<400> 



30 
32 
33 
34 
36 
37 
40 
43 



TYPE: DNA 
ORGANISM: 
FEATURE : 
NAME/KEY : 
LOCATION: 
FEATURE : 
NAME/KEY 
LOCATION 
SEQUENCE 



AAV-1 

CDS 
(335) 



(2206) 



CDS 

(2223) .. (4430) 
1 

ttgcccactc cctctctgcg cgctcgctcg ctcggtgggg cctgcggacc aaaggtccgc 60 
agacggcaga gctctgctct gccggcccca ccgagcgagc gagcgcgcag agagggagtg 120 
ggcaactcca tcactagggg taatcgcgaa gcgcctccca cgctgccgcg tcagcgctga 180 



46 cgtaaattac gtcatagggg agtggtcctg tattagctgt cacgtgagtg cttttgcgac 240 
49 attttgcgac accacgtggc catttagggt atatatggcc gagtgagcga gcaggatctc 300 
52 cattttgacc gcgaaatttg aacgagcagc agcc atg ccg ggc ttc tac gag ate 3 55 



54 

55 




















Met Pro Gly Phe Tyr Glu He 




57 


gtg 


ate 


aag 


gtg 


ccg 


age 


gac 


ctg 


gac 


gag 


cac 


ctg 


ccg 


ggc 


att 


tct 


403 


59 


Val 


He 


Lys 


Val 


Pro 


Ser 


Asp 


Leu 


Asp- 


Glu 


His 


Leu 


Pro 


Gly 


He 


Ser 




60 






10 










15 










20 










62 


gac 


teg 


ttt 


gtg 


age 


tgg 


gtg 


gec 


gag 


aag 


gaa 


tgg 


gag 


ctg 


ccc 


ccg 


451 


64 


Asp 


Ser 


Phe 


Val 


Ser 


Trp 


Val 


Ala 


Glu 


Lys 


Glu 


Trp 


Glu 


Leu 


Pro 


Pro 




65 




25 










30 










35 












67 


gat 


tct 


gac 


atg 


gat 


ctg 


aat 


ctg 


att 


gag 


cag 


gca 


ccc 


ctg 


acc 


gtg 


499 


69 


Asp 


Ser 


Asp 


Met 


Asp 


Leu 


Asn 


Leu 


He 


Glu 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 




70 


40 










45 










50 










55 




72 


gec 


gag 


aag 


ctg 


cag 


cgc 


gac 


ttc 


ctg 


gtc 


caa 


tgg 


cgc 


cgc 


gtg 


agt 


547 


74 


Ala 


Glu 


Lys 


Leu 


Gin 


Arg 


Asp 


Phe 


Leu 


Val 


Gin 


Trp 


Arg 


Arg 


Val 


Ser 




75 










60 










65 










70 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: . US/09/807 , 802 



DATE: 0 5/01/2001 
TIME: 11^39:02 



Input Set : A:\seqiiencelist.txt 

Output Set: N:\CRF3\O5012001\I807802.raw 



E--> 



E--> 



E--> 



E--> 



E--> 



E--> 



77 

79 

80 

82 

84 

85 

87 

89 

90 

92 

94 

95 

97 

99 

100 

102 

104 

105 

107 

109 

110 

112 

114 

115 

117 

119 

120 

122 

123 

124 

125 

127 

128 

129 

130 

132 

133 

134 

135 

137 

138 

139 

140 

142. 

143 

144 

145 

147 

148 



aag gcc ccg gag gcc. etc ttc 
Lys Ala Pro Glu Ala Leu Phe 

75 

tac ttc cac etc cat att ctg 
Tyr Phe His Leu His lie Leu 
90 

gtg ctg ggc cgc ttc ctg agt 
Val Leu Gly Arg Phe Leu Ser 
105 110 
ate tac cgc ggg ate gag ccg 
lie Tyr Arg Gly lie Glu Pro 
120 125 
aag acg cgt aat ggc gcc gga 
Lys Thr Arg Asn Gly Ala Gly 

140 

ccc aac 
Pro Asn 
155 
act aac 
Thr Asn 
170 

gag cgc aaa egg etc gtg gcg 
Glu Arg Lys Arg Leu Val Ala 
185 190 
cag gag cag aac aag gag*aat 
Gin Glu Gin Asn Lys Glu Asn 
200 205 

ate egg tea aaa ace tec gcg 
1027 — ■ 

lie Arg Ser Lys Thr Ser Ala 

220 

gtg gac egg ggc ate ace tec 

1075 



tac ate 
Tyr lie 



gcg 
Ala 



tgg 
Trp 



tac 
Tyr 

atg 
Met 



etc 
Leu 

gag 
Glu 



ctg 
Leu 

gag 
Glu 



ttt gtt cag ttc gag aag ggc gag tec 595 
Phe Val Gin Phe Glu Lys Gly Glu Ser 

80 85 
gtg gag ace acg ggg gtc aaa tec atg 64 3 
Val Glu Thr Thr Gly Val Lys Ser Met 

95 100 
cag att agg gac aag ctg gtg cag ace 691 
Gin lie Arg Asp Lys Leu Val Gin Thr 

' 115 

ace ctg ccc aac tgg ttc gcg gtg acc 739 
Thr Leu Pro Asn Trp Phe Ala Val Thr 

130 135 
999 999 aac aag gtg gtg gac 'ga^f tgc 787 
Gly Gly Asn Lys Val Val Asp Glu Cys 
145 150 
ccc aag act cag ccc gag ctg cag tgg 835 
Pro Lys Thr Gin Pro Glu Leu Gin Trp 

160 165 
tat ata age gcc tgt ttg aac ctg gcc 883 
Tyr lie Ser Ala Cys Leu Asn Leu Ala 
175 180 

cag cac ctg acc cac gtc age cag acc 931 
Gin His Leu Thr His Val Ser Gin Thr 

195 

ctg aac ccc aat tct gac gcg cct gtc 979 
Leu Asn Pro Asn Ser Asp Ala Pro Val 

210 215 

cgc tac atg gag ctg gtc ggg tgg ctg 

^1 



Arg Tyr Met Glu Leu Val Gly Trp Leu 

225 230 

gag aag cag tgg ate cag gag gac cag 
— _ — ^ 




Val Asp Arg Gly lie Thr Ser 

235 

gcc teg tac ate tec ttc aac 

1123 

Ala Ser Tyr lie Ser Phe Asn 
250 

aag gcc get ctg gac aat gcc 

1171 

Lys Ala Ala Leu Asp Asn Ala 
265 270 
gcg ccc gac tac ctg gta ggc 

1219 

Ala Pro Asp Tyr Leu Val Gly 
280 285 
aac cgc ate tac cgc ate ctg 
1267 ■ 



Glu Lys Gin Trp lie Gin Glu Asp Gin 

240 245 
gcc get tec aac teg egg tec cag ate 



Ala Ala Ser Asn Ser Arg Ser Gin lie 
255 260 



Gly Lys lie Met Ala Leu Thr Lys Ser 

2.75 

ccc get ccg' ccc gcg gac att aaa acc^ 



Pro Ala Pro Pro Ala Asp lie Lys Thr 

290 1 295 
gag ctg aac ggc tac gaa cct gcc tac 
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RAW SEQUENCE LISTING DATE: 05/01/2001 

PATENT APPLICATION: . US/09/807 , 802 TIME: 11^:39:02 



Input Set : A:\sequencelist.txt 

Output Set: Nj\CRF3\05012001\I807802.raw 

14 9 Asn Arg lie Tyr Arg lie Leu Glu Leu Asn Gly Tyr Glu Pro Ala Tyr 
150 300 305 310 

E--> 152^gcc gac tec gtc ttt etc ggc tgg gcc cag aaa agg ttc ggg aag cgc 
153V1315j> 

154 Ala Gly Ser Val Phe Leu Gly Trp Ala Gin Lys Arg Phe Gly Lys Arg 

155 315 320 325 

E--> 157 ajic.acc ate tgg ctg ttt ggg ccg gcc acc acg ggc aag acc aac ate 

158 ^363 J 

159 Asn Thr lie Trp Leu Phe Gly Pro Ala Thr Thr Gly Lys Thr Asn lie 

160 330 335 340 

E--> 162 acg gaa gcc ate gcc cac gcc gtg ccc ttc tac ggc tgc gtc aac tgg 

163( l41j ^) 

164 AlaGlu Ala He Ala His Ala Val Pro Phe Tyr Gly Cys Val" AsTn Trp 

165 345 350 355 

E--> 167 acc aat gag aac ttt ccc ttc aat gat tgc gtc gac aag atg gtg ate 
168 /I45j y 

169 Thr Asn Glu Asn Phe Pro Phe Asn Asp Cys Val Asp Lys Met Val He 

170 360 365 370 375 
E--> 172 tgg tgg gag gag ggc aag atg acg gcc aag gtc gtg gag tec gcc aag 

173(1507^/ 

174 Trp"Trp Glu Glu Gly Lys Met Thr Ala Lys Val Val Glu Ser Ala Lys 

175 380 385 390 
E--> 177 gcc att etc ggc ggc age aag gtg cgc gtg gac caa aag tgc aag teg 

178 Q5S5J 

179 Ala He Leu Gly Gly Ser* Lys Val Arg Val Asp Gin Lys Cys Lys Ser 

180 395 400 405 

E--> 182 tec gcc cag ate gac ccc acc ccc gtg ate gtc acc tec aac acc aac 
183(l603y 

184 "SefAla Gin He Asp Pro Thr Pro Val He Val Thr Ser Asn Thr Asn 

185 410 415 420 

E--> 187 atg tac gcc gtg att gac ggg aac age acc acc ttc gag cac cag cag 

1 8 8V1 6 5 l^J 

189"Met"^Cys Ala Val He Asp Gly Asn Ser Thr Thr Phe Glu His Gin Gin 
190 425 430 435 

E--> 192 ccg t^tg cag gac egg atg ttc aaa ttt gaa etc acc cgc cgt ctg gag 

193^1699 / 

194 >r6 Leu Gin Asp Arg Met Phe Lys Phe Glu Leu Thr Arg Arg Leu Glu 

195 440 445 450 455 
E--> 197 cat gac ttt ggc aag gtg aca aag cag gaa gtc aaa gag ttc ttc cgc 

198Ql747j) 

199 HTs~~Asp Phe Gly Lys Val Thr Lys Gin Glu Val Lys Glu Phe Phe Arg 

200 460 465 470 
E--> 202 tgg gcg cag gat cac gtg acc gag gtg gcg cat gag ttc tac gtc aga 

20£ 1795 ; ■ 

204 Trp* Ala Gin Asp His Val Thr Glu Val Ala His Glu Phe Tyr Val Arg 

205 475 480 485 

E--> 207 aag ggt gga gcc aac aaa aga ccc gcc ccc gat gac gcg gat aaa age 
208 ^1843 J 

209 Lys Gly Gly Ala Asn Lys Arg Pro Ala Pro Asp Asp Ala Asp Lys Ser 
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RAW SEQUENCE LISTING DATE: 05/01/2001 

PATENT APPLICATION: US/09/807,802 TIME: 11:39:02 



Input Set : A:\sequencelist.txt 

Output Set: N?\CRF3\05012001\I807802.raw 



210 490 495 500 

E--> 212 jjag -ccc aag egg gec tgc ccc tea gtc gcg gat cca teg acg tea gac 
21*1891^ 

21^t?*u Pro Lys Arg Ala Cys Pro Ser Val Ala Asp Pro Ser Thr Ser Asp 
215 505 510 515 

E--> 217 gcg gaa gga get ccg gtg gac ttt gec gac agg tac caa aac aaa tgt 

218A939) 

219^Ta~ Glu Gly Ala Pro Val Asp Phe Ala Asp Arg Tyr Gin Asn Lys Cys 
220 520 525 530 535 

E--> 222 tct cgt cac gcg ggc atg ctt cag atg ctg ttt ccc tgc aag aca tgc 

223 /198 7: 

224 ^Ser Arg His Ala Gly Met Leu Gin Met Leu Phe Pro Cys Lys Thr Cys 
225 540 545 155$' 

E--> 227 qag aga atg aat cag aat ttc aac att tgc ttc acg cac ggg acg aga 

228 (2035 J 

229 TrtTr"£rg Met Asn Gin Asn Phe Asn lie Cys Phe Thr His Gly Thr Arg 

230 555 560 565 
E--> 232 aac tgt tea gag tgc ttc ccc ggc gtg tea gaa tct caa ccg gtc gtc 

233^2083^ 

234^Asp^Cys Ser Glu Cys Phe Pro Gly Val Ser Glu Ser Gin Pro Val Val 
235 570 575 580 

E--> 237 a,ga aag agg acg tat egg aaa etc tgt gee att cat cat ctg ctg ggg 

238^2131 ^ 

239 ATg^Lys Arg Thr Tyr Arg Lys Leu Cys Ala lie His His Leu Leu Gly 

240 585 -590 595 

E - - > 24 2 egg get ccc gag att get tgc teg gee tgc gat ctg gtc aac gtg gac 

243 (2179J 

244 Tufg Ala Pro Glu He Ala Cys Ser Ala Cys Asp Leu Val Asn Val Asp 

245 600 605 610 615 
E--> 24 7 ctg gat x gac tgt gtt tct gag caa taa atgacttaaa ccaggt atg get gee 

248 ^231 ) 

249~TeuAsp Asp Cys Val Ser Glu Gin Met Ala Ala 

W--> 250 620 625 

E--> 252 qat gat tat ctt cca gat tgg etc gag gac aac etc tct gag ggc att 

2 53 /2 279 J 

2 54 Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser Glu Gly He 
W--> 255 630 635 640 

E--> 257 cgc gag tgg tgg gac ttg aaa cct gga gee ccg aag ccc aaa gee aac 

258(^2327 ) 

2 59 Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro Lys Ala Asn 
W--> 260 645 650 655 

E--> 262 cag caa aag cag gac gac ggc egg ggt ctg gtg ctt cct ggc tac aag 

263^2375 y 

264 Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro Gly Tyr Lys 
W--> 265 660 665 670 675 

E--> 267 tac etc gga ccc ttc aac gga etc gac aag ggg gag ccc gtc aac gcg 

268 {2423 , 

269 Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro Val Asn Ala 
W--> 270 680 685 690 
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RAW SEQUENCE LISTING DATE: 05/01/2001 

PATENT APPLICATION: . US/09/807,802 TIME: 11:39:02 



Input Set : A:\sequencelist.txt 

Output Set: N:\CRF3\05012001\I807802.raw 



E--> 272 acg gac gca gcg gcc etc gag cac gac aag gec tac gac cag cag etc* 

273 mnx) 

274 TCTaAsp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp Gin Gin Leu 
W--> 275 695 700 705 

E--> 277 aaa gcg ggt gac aat ccg tac ctg egg tat aac cac gcc gac gcc gag 
278^519^) 

279 Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala Asp Ala Glu 
W--> 280 710 715 720 

E--> 282 ttt- cag gag cgt ctg caa gaa gat acg tct ttt ggg ggc aac etc ggg 

283 /2567j) 

284 ^Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly Asn Leu Gly 
W--> 285 725 730 735 

E--> 287 cga gca gtc ttc cag gcc aag aag egg gtt etc gaa cct etc ^gt ctg 

288(261^) 

289 "Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro Leu Gly Leu 
W--> 290 740 745 750 755 

E--> 292 gtt gag gaa ggc get aag acg get cct gga aag aaa cgt ccg gta gag 

293/2663 ) 

2941 terr'Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg Pro Val Glu 
W--> 295 760 765 770 

E--> 297 cag teg cca caa gag cca gac tec tec teg ggc ate ggc aag aca ggc 

298(2711^) 

299 StffSer Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly Lys. Thr Gly 
W--> 300 775 780 785 

E--> 302 cag cag ccc get aaa aagaga etc aat ttt ggt cag act ggc gac tea 

303 /2759 

304 6lrTl31n Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr Gly Asp Ser 
W--> 305 790 795 800 

E--> 307 aag tea gtc ccc gat cca caa cct etc gga gaa cct cca gca ace ccc 
308^ 2807 ' 

309 Glu^Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro Ala Thr Pro 
W--> 310 805 810 815 

E--> 312 act get gtg gga cct act aca atg get tea ggc ggt ggc gca cca atg 
313( 2855 ) 

314 Via" Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly Ala Pro Met 
W--> 315 820 825 830 835 

E--> 317 gca gac aat aac gaa ggc gcc gac gga gtg ggt aat gcc tea gga aat 

318 6903J 

319^Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala Ser Gly Asn 
W--> 320 840 845 850 

E--> 322 tgg cat tgc gat tec aca tgg ctg ggc gac aga gtc ate ace acc age 

323Q951J 

324 Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie Thr Thr Ser 
W--> 325 855 860 865 

E--> 327 >acc cgc acc tgg gcc ttg ccc acc tac aat aac cac etc tac aag caa 

328(2999j; 

329 ThT" Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu Tyr Lys Gin 
W--> 330 870 875 880 

E--> 332 ate tec agt get tea acg ggg gcc age aac gac aac cac tac ttc ggc 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: .US/09/807,802 



DATE: 05/01/2001 
TIME: 11:39:02 



W- 
E- 



W- 
E- 



W 

E 



W 

E- 



W- 
E 



E- 



W 

E- 



E- 



W 

E- 



W 

E- 



E- 



W- 
E- 



Input Set : A:\sequencelist.txt 
Output Set: N:\CRF3\05012001\I807802.raw 

33^3047 ) 

334^Tre— Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His Tyr Phe Gly 
-> 335 885 890 * " 895 

-> 337 tatragc acc ccc tgg ggg tat ttt gat ttc aac aga ttc cac tgc cac 

338(3095^5 

339 Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His Cys His 
-> 340 900 905 910 915 

-> 34 2 ttt tea cca cgt gac tgg cag cga etc ate aac aac aat tgg gga ttc 

34 3^3143^) 

34 4^Ph€ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp Gly Phe 
--> 345 920 925 930 

--> 34 7 egg ccc aag aga etc aac ttc aaa etc ttc aac ate caa gtc aag gag 
348^3191) v/ 
34 9 Svrg~"Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn lie Gin Val Lys Glu 
■-> 350 935 940 945 

--> 3 52 gtc acg acg aat gat ggc gtc aca acc ate get aat aac ctt acc age 
353 (3239^ 

3 54 Val' Thr Thr Asn Asp Gly Val Thr Thr lie Ala Asn Asn Leu Thr Ser 
-> 355 950 955 960 

■-> 3 57 acg gtt caa gtc ttc teg gac teg gag tac cag ctt ccg tac gtc etc 

358 /S287J) 

359 Thr Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro Tyr Val Leu 
-> 360 965 970 975 

-> 3 62 / ^ggc tct gcg cac cag ggc tgc etc cct ccg ttc ccg gcg gac gtg ttc 

36/ 3335V : 

3 64~Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp Val Phe 
-> 365 980 985 990 995 

-> 367 atg att ccg caa tac ggc tac ctg acg etc aac aat ggc age caa gee 

368 (3383J 

369^Metr'fle Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly. Ser Gin Ala 
-> 370 1000 1005 1010 

-> 3 72 gtg" gga cgt tea tec ttt tac tgc ctg gaa tat ttc cct tct cag atg 

373J3 43jJ7 

374 Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser Gin Met 
-> 375 1015 1020 1025 

-> 377 >crtg aga acg ggc aac aac ttt acc ttc age tac acc ttt gag gaa gtg 

378(^3479^7 

379 Ceu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu Glu Val 
-> 380 1030 1035 1040 

-> 3 82 ject ttpc cac age age tac gcg cac age cag age ctg gac egg ctg atg 

383/jlJ527.y 

384 Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg Leu Met 
-> 385 1045 1050 1055 

-> 38*V*aat cct etc ate gac caa tac ctg tat tac ctg aac aga act caa aat 

3 88 3575J) 

389"^s n Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg Thr Gin Asn 
-> 390 1060 1065 1070 1075 

-> 392^-cag tec gga agt gee caa aac aag gac ttg ctg ttt age cgt ggg tct- 

39(3 3623 
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RAW SEQUENCE LISTING DATE: 05/01/2001 

PATENT APPLICATION: . US/09/807,802 TIME: 11:39:02 



Input Set : A:\sequencelist.txt 

Output Set: N:\CRF3\05012001\I807802.raw 



W 

E- 



W- 
E- 



W 

E- 



W 
E 



W 

E 



W 
E 



W 

E- 



W 

E- 



E- 



W 

E- 



E- 



E- 



394 Gin Ser Gly Ser Ala Gin Asn Lys Asp Leu Leu Phe Ser Arg Gly Ser 

-> 395 1080 1085 1090 

--> 397 jcea-gct ggc atg tct gtt cag' ccc aaa aac tgg eta cct gga ccc tgt 
398f 3671 ^ 

399"-Prtr*£la Gly Met Ser Val Gin Pro Lys Asn Trp Leu Pro Gly Pro Cys 

--> 400 1095 1100 1105 

--> 4 02 tai <Kjg cag cag cgc gtt tct aaa aca aaa aca gac aac aac aac age 

403 mis) 

4 04 Tyr Arg Gin Gin Arg Val Ser Lys Thr Lys Thr Asp Asn Asn Asn Ser 
--> 405 * 1110 1115 1120 

■-> 4 07 ajit..ttt ace tgg act ggt get tea aaa tat aac etc aat ggg cgt gaa 

408/1767) 

409 f Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn Gly Ajfg Glu 
--> 410 1125 1130 1135 

--> 412 tec ate ate aac cct ggc act get atg gec tea cac aaa gac gac gaa 
413/1815 ) 

414 < *Ser--fie lie Asn Pro Gly Thr Ala Met Ala Ser His Lys Asp Asp Glu 
■-> 415 1140 1145 1150 1155 

--> 417 gac aag ttc ttt ccc atg age ggt gtc atg att ttt gga aaa gag age 

418 (863 ) 

419 Asp-'£ys Phe Phe Pro Met Ser Gly Val Met He Phe Gly Lys Glu Ser 
-> 420 1160 1165 1170 

•-> 422 gee gga get tea aac act gca ttg gac aat gtc atg att aca gac gaa 

423 (391^ 

424 Ala Gly Ala Ser Asn Thr"Ala Leu Asp Asn Val Met He Thr Asp Glu 
-> 425 1175 1180 1185 

-> 427 aag gaa att aaa gee act aac cct gtg gee ace gaa aga ttt ggg ace 

4 2 ^39 59 J 

429 Giu Glu He Lys Ala Thr Asn Pro Val Ala Thr Glu Arg Phe Gly Thr 
-> 430 1190 1195 1200 

-> 4 32 atg gca gtc aat ttc cag age age age aca gac cct gcg acc gga gat 

4 3 3(7007^ 

4 34 V51' Ala Val Asn Phe Gin Ser Ser Ser Thr Asp Pro Ala Thr Gly Asp 
-> 435 1205 1210 1215 

> 4 37 gtg -cat get atg gga gca tta cct ggc atg gtg tgg caa gat aga gac 



43fi( 4055^ 
4 39 Vail Hi 



His Ala Met Gly Ala Leu Pro Gly Met Val Trp Gin Asp Arg Asp 

> 440 1220 1225 1230 1235 

> 442 jgfcg tac ctg cag ggt ccc att tgg gee aaa att cct cac aca gat gga 
443(4103 ^) 

444 ^731 Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro His Thr Asp Gly 

> 445 1240 1245 1250 

> 44 7 cac ttt cac ccg tct cct ctt atg ggc ggc ttt gga etc aag aac ccg 
44^ 415lJ) 

449 kis Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys Asn Pro 

> 450 1255 1260 1265 

> 4 52x<rct cct cag ate etc ate aaa aac acg cct gtt cct gcg aat cct ccg 
4 53( 4199 ) 

4 54 Vro~" Pro Gin He Leu He Lys Asn Thr Pro Val Pro Ala Asn Pro Pro 
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RAW SEQUENCE LISTING DATE: 05/01/2001 

PATENT APPLICATION: * US/09/807 , 802 TIME: 11*:39:02 



Input Set : A:\sequencelist.txt 

Output Set: N:\CRF3\05012001\I807802.raw 



W- 
E- 



W- 
E- 



W- 
E- 



W- 
E- 



W- 
E- 



> 
> 



> 
> 



> 
> 



> 
> 



> 
> 



455 



1270 



1275 



457^gcg~gag ttt tea get aca aag ttt get tea ttc ate 
45^4247 N 



1280 
ace 



caa tac tec 




a Glu Phe Ser Ala Thr Lys Phe Ala 
1285 1290 

ga caa gtg agt gtg gaa att gaa 



Gly Gin Val 



465 1300 
467 age a~ag 



cgc tgg 



Ser Val 
1305 
aat ccc 



Glu He Glu 



gaa gtg cag 



Ser Phe He 
1295 
tgg gag ctg 

Trp Glu Leu 

1310 
tac aca tec 



Thr Gin Tyr Ser 



cag aaa gaa aac 



Gin Lys 



aat tat 



Glu Asn 
1315 
gca aaa 



468^4343 ^> 
4 69 fv Ser~"Lys Arg 




gee aac 



Trp Asn Pro 

1320 
gtt gat ttt 



Glu 



act 



Tyr 
1325 
gtg gac aac 



Thr Ser Asn 



aat gga ctt 



Tyr Ala 
tat act 



Lys 



gag 



Asn Val Asp Phe Thr Val Asp Asn Asn Gly Leu Tyr Thr Glu 

1335 1340 1345 

ccc att ggc ace cgt tac ctt acc cgt ccc ctg taattacgtg 



W- 
E- 

E 

E 

E 

E 



> 
> 

--> 
--> 



Val Gin 

470 
472 
473 
474 
475 

477 ocrt cgc 
478^4440. 

479 Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Pro Leu 

1350 1355 1360 

ttaa_tcaata aaccggttga ttcgtttcag ttgaactttg gtctcctgtc cttcttatct 

tateggttae catggttata gcttacacat taactgcttg gttgegctte gcgataaaag 

4 86/4 560 ) ' 

4 8 8 JHJClfacgtca tegggttace cctagtgatg gagttgecca ctccctctct gcgcgctcgc 
489(46^0-^ 

4 91 tegcteggtg gggectgegg accaaaggtc cgcagacggc agagctctgc tctgccggcc 



480 
482 

483 
485 



--> 4 91 tegetc* 
49^ 4680 ^> 
--> 49)r~e~caccg« 



495 
767 
768 
769 
770 



caccgagcg agegagegeg cagagaggga gtgggcaa 
4718 

SEQ ID NO: 4 
LENGTH: 1872 
TYPE: DNA 

AAV-1 



<210> 
<211> 
<212> 
<213> 

772 <220> 

773 <221> 

774 <222> 



ORGANISM: 
FEATURE: 
NAME/KEY 
LOCATION 



776 <400> SEQUENCE 

777 atg ccg ggc ttc 
Pro Gly Phe 



779 
780 
782 
784 
785 
787 
789 
790 
792 



Met 
1 

gag 
Glu 

aag 
Lys 



cac 
His 

gaa 
Glu 



gag cag 



ctg ccg 
Leu Pro 
20 

tgg gag 
Trp Glu 

35 
gca 



CDS 

(1) 
4 

tac 
Tyr 
5 

ggc 
Gly 



(1869) 



gag ate gtg ate aag gtg ccg age gac ctg gac 48 

Glu He Val He Lys Val Pro Ser Asp Leu Asp 

10 15 

att tct gac teg ttt gtg age tgg gtg gee gag 96 

He Ser Asp Ser Phe Val Ser Trp Val Ala Glu 

25 30 

ctg ccc ccg gat tct gac atg gat ctg aat ctg att 14 4 

Leu Pro Pro Asp Ser Asp Met Asp Leu Asn Leu He 

40 45 

ccc ctg acc gtg gee gag aag ctg cag cgc gac ttc ctg 192 



The types of errors shown exist throughout the Sequence Listing, 
subsequent sequences for similar errors. 
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VERIFICATION SUMMARY DATE: 05/01/2001 

PATENT APPLICATION: US/09/807 , 802 TIME: 11? 39: 03 

Input Set : Ai\sequencelist.txt 

Output Set: Ni\CRF3\05012001\I807802.raw 

L:13 M:270 C: Current Application Number differs, Replaced Application Number 
L:14 M:271 C: Current Filing Date differs.. Replaced Current Filing Date 
L:122 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:1027 SEQ:1 
M:254 Repeated in SeqNo=l 

L:250 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:255 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:260 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:265 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:270 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:275 M:336 W: 'Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:280 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:285 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:290 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ l6:Y 
L:295 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:300 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:305 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:310 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 . 
L:315 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:320 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:325 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:330 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:335 M-336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:340 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ IDrl 
L:345 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:350 M:336 W: Invalid Amino Acid* Number in Coding Region, SEQ ID:1 
L:355 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:360 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:365 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:370 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:375 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:380 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:385 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:390 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:395 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:400 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:405 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:410 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:415 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:420 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:425 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:430 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:435 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:440 M;336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:445 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:450 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:455 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:460 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
L:465 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 
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r. i,,y#.r ^nff'H. 




05/01/2001 
ll!39:03 



L:470 M:336 W 
L:475 M:336 W 
L:480 M:336 W 
L:877 M:254 E 
M: 254 . Repeated 
L:1207 M:254 E: 
M:254 Repeated 
L:1505 M:254 E: 
M:254 Repeated 
L:1913 M:254 E": 
M:254 Repeated 
L:2302 M:254 E: 
M:254 Repeated 
L:2624 M:254 E: 
M:254 Repeated 
L:2858 M:254 E: 
M:254 Repeated 
L:3102 M:254 E: 
M:254 Repeated 



VERIFICATION SUMMARY DATE: 
PATENT APPLICATION: 'US/09/807,802 TIME: 

Input Set : A:\sequencelist.txt 

Output Set: N:\CRF3\05012001\I807802.raw 



Invalid Amino Acid Number in Coding Region, SEQ ID:1 
Invalid Amino Acid Number in Coding Region, SEQ ID:1 
Invalid Amino Acid Number in Coding Region, SEQ ID:1 
No. of- Bases conflict, LENGTH : Input : 0 Counted: 1008 SEQ: 4 
in SeqNo=4 

No. of Bases conflict, LENGTH : Input : 0 Counted: 1008 SEQ: 6 
in SeqNo=6 

No. of Bases conflict, LENGTH : Input : 0 Counted:1008 SEQ:8 
in SeqNo^ 

No. of Bases conflict, LENGTH: Input :0 Counted:1008 SEQ:12 
in SeqNo-12 

No. of Bases conflict, LENGTH : Input : 0 Counted: 1008 £EQ:14 
in SeqNo=14 * 

No. of Bases conflict, LENGTH : Input : 0 Counted:1008 SEQ:16 
in SeqNo=16 

No. of Bases conflict, LENGTH : Input : 0 Counted:1020 SEQ:18 
in SeqNo=18 

No. of Bases conflict, LENGTH : Input : 0 Counted:1020 SEQ:19 
in SeqNo=19 
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